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In this article we study the dependence degree of the traded volume of the Dow Jones 30 con- 
stituent equities by using a nonextensive generalised form of the Kullback-Leibler information mea- 
sure. Our results show a slow decay of the dependence degree as a function of the lag. This feature 
is compatible with the existence of non-linearities in this type time series. In addition, we introduce 
a dynamical mechanism whose associated stationary probability density function (PDF) presents a 
good agreement with the empirical results. 

i PACS numbers: 05.45.Tp — Time series analysis; 89.65.Gh — Economics, econophysics, financial markets, 

business and management; 05. 40. -a — Fluctuation phenomena, random processes, noise and Brownian 
' motion. 
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I. INTRODUCTION 



The study of complexity, in particular within financial systems, has become one of the main focus of interest in 
statistical physics In fact, several statistical properties verified in financial observables, e.g., relative price changes 
(the return) and returns standard deviation (the volatility), have enabled the establishment of new models which 
characterise systems ever better 0. Along with the previous two quantities, another key observable is the number 
of stocks of a certain company traded in a given period of time, the traded volume, v. In this article we analyse 
the dependence degree of 1-minute traded volume time series, V(t), of the constituents of the Dow Jones Industrial 
Average 30 index (DJ30), between the I s * of July 2004 and the 31 s * of December 2004. We introduce also a dynamical 
mechanism that provides the same stationary PDF [j|,Q>[^. In order to avoid spurious features, we have removed 
intra-day pattern of the original time series and normalised each element of the series by its mean value defining the 

, v t E V ^ 

normalised traded volume, v (t) = ^v'(t)) wnere V (*) = g$7J) " (*') = ' =1 jv — an< ^ (• ■ •) * s defined as the average 

i— i ' over time (t 1 represents the intra-day time and i the day) . 



II. DEPENDENCE DEGREE 

Discrimination between two hypothesis, consistent testing, is ubiquitous in science. Examples are the 
stationary/non-stationary character of time series or the dependence degree between its elements. Concerning the 
latter, the most widely applied measure of "dependence" between variables is the correlation function mathematically 
defined as, 



(v(t)v(t + r))-(v(t)) 2 



C[v(t),v(t + r)]= , 

(v(tf)-(v(t)) 2 

>>: 

■ Since the correlation function is basically a normalised covariance (or the second cumulant of the stochastic process) 



it will only be a suitable statistical procedure for linear correlations or correlations that can be written in a linear 
way. In other words, the correlation function is not able to determine conveniently non-linearities in a given group 
of data. Aiming to consistently test the dependence or independence of stochastic variables it was recently defined 
a dependence measure that has been able to evaluate non-linearities, for instance, in daily return time series [6j and 



GARCH processes Q for which the correlation function gives zero value. 
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FIG. 1: Left: Normalised generalised Kullback-Leibler measure, R q i , vs. entropic index, q' , for the International Business 
Machines (IBM). The inset shows, as mere illustration, the derivative of R in respect to q' for r = 1 . The maximum corresponds 
to q° p = 1.58. Right: The symbols represent the dependence degree, q op , vs. r (in minutes) averaged over the 30 time series. 
The line represents a fitting logarithmic function (q° p — 1.59 + 0.11 log(r)) (the correlation coefficient is 0.9944) pointing up 
the slow increase of q° v . 



So, let us start by defining our dependence measure as the non-extensive generalised mutual information measure, 

iq> = - Jp(y) lv^y^ 

where ln g / (y) — y ,_ 1 (ky (y) = lni (y)), which emerged within the non-extensive formalism based on Tsallis 
entropy 8}. For q' = 1, it is equivalent to the Kullback-Leibler information gain llfij. 

Let us now assume that y is a two-dimensional random variable y = (x, z). We can quantify the degree dependence 
between x and z by computing I q i for p (x, z) and p' (x, z) = p x (x) p 2 (z), where p...(. ■ .) represents the marginal 
probability. For this case, I q i presents both a lower bound and an upper bound. The former, I^f IN = 0, corresponds 
to total independence between x and z, i.e. p (x, z) — p' (x, z). The latter, I^f AX , represents a one-to-one dependence 
between variables and is given by, 

I™ AX = -JJp(x,z) [\n q , Pl (x) + 

(1 — q) hiqi p 1 (x) \ri q r p 2 (z)] dxdz. 

From these two extreme values, it is then possible to define a normalised measure, 

I q , 

^8' = tMAX G [0> ' 

q' 

which has an optimal index, q op (where the prime was suppressed for clarity) . 

This index is optimal in the sense that the gradient of the measure R is most sensitive and hence most capable 
of determine variations in the dependence among the variables. Moreover, it is optimal because its two extreme 
values are associated to full dependence and full independence between x and z. Analytically, it is determined by the 
inflection point of R q i vs q' curves. For one-to-one dependence we have q op — 0, and q op — oo for total independence 
(see reference 3] for a detailed discussion). 

We have computed R q > for all time series with x = v(t), z = v(t + r), where r represents the lag. A typical example 
is presented in Fig. ^ (left panel). Analysing the behaviour of q op as a function of r, we have observed a slow increase 
of q op , i.e., a slow decrease in the dependence degree between variables as it is visible in Fig. (left panel). Our result 
reveals the existence of significant non-linear dependences which seem to be present even for large times. In Fig. |3it 
is possible to see that the correlation value between r = 1 and r = 1000 diminishes around 80% while the q op value 
between r = 1024 and r = 1 only reduces in 20% (approximately), i.e., a decrease in the dependence degree in the 
same amount. 
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FIG. 2: Left: Symbols represent the average correlation function for the 30 time series analysed and the line represents a double 
exponential fit with characteristic times of 7 -1 = 13 and T = 332 yielding a ratio about 25 between the two time scales Eq. © 
(R 2 = 0.991 and x 2 = 9x 10~ 6 and time in minutes). 



TABLE I: Obtained values from: PDF fitting (q, 8 and a) and from correlation analysis (7 T). 
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9.20 
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2.32 
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DD 
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7.33 


2.26 
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1 


2 1 


7.29 


2.19 


20 




1 


17 


8.31 


2.75 


33 


GM 
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21 


8.14 


2.46 


20 


HD 


1 


17 


8.76 


2.84 


27 


HON 


1 


19 


9.06 


2.67 


70 


HPQ 


1 


19 


8.55 


2.64 


28 


IBM 
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14 


12.36 


3.70 


41 


INTC 
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20 


4.22 


1.70 


25 


JNJ 
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17 


8.55 


2.91 


11 


JPM 
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17 


9.14 


2.92 


22 


KO 
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19 


7.88 


2.61 


26 


MCD 
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21 


7.48 


2.30 


30 


MMM 
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19 


7.14 


2.33 


23 


MO 
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18 


7.73 


2.66 


12 


MRK 
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25 


1.24 


0.61 


21 


MSFT 
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22 
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1.62 


23 


PFE 
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18 


6.31 


2.44 


33 
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16 


8.94 


2.99 


23 
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19 


8.62 


2.57 


25 


UTX 
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14 


18.47 


4.71 


32 


vz 
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17 


8.83 


2.84 


34 


WMT 
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16 


10.24 


3.23 


30 


XOM 
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15 


11.45 


3.50 


31 



III. A POSSIBLE DYNAMICAL MODEL FOR TRADED VOLUMES 



The non-linear character of a time series manifests on the exhibition of (asymptotic) power-law behaviour of the 
stochastic variable (stationary) PDF. This power-law-like behaviour of the PDF was also verified for traded volume 
time series In order to describe a possible dynamical mechanism for this observable, let us suppose that the 

traded volume of an equity is described by the following stochastic differential equation, 



dv 



-7(« )dt + 



2—vdWt, 



(1) 



where Wt is a regular Wiener process following a normal distribution and v > 0. The right-hand side of Eq. (JJJ may 
be interpreted as follows: the deterministic term represents a natural mechanism of the system which aims to keep the 
traded volume at some "normal" value, to /a with a relaxation time of order of 7 -1 . The stochastic term mimics the 
microscopic effects on the evolution of v, just like a multiplicative noise used to replicate intermittent processes. This 
dynamics and the corresponding Fokker-Planck equation [Tlj leads to an inverted Gamma stationary distribution, 



f(v) 



ujT [a + I] \u> 



-a-2 



exp 



(2) 



Consider now, in the same lines of Beck and Cohen superstatistics 0], that instead of constant, u is a time dependent 
quantity which evolves on a time scale T larger than the time scale 7 -1 required by Eq. to reach stationarity. 
This time dependence is, in the present model, associated to changes in the volume of activity (number of traders 
that performed transactions) Furthermore, if we assume that to follows a Gamma PDF, 



XT [6] 



(1) 



5-1 




J exp 





the long-term distribution of v will be given by p (v) — J f (v) P (uj) duo which yields, 



exp 



(3) 



(4) 



where A = 6 (q — 1) 



9-1 



1 and exp q [x] = [1 + (1 — q) x] 1 ^ 1 q ^ is the g-exponential function, the inverse 
function of ln g (y) (expj [a;] = e x ) [8j, Z being the normalisation constant. 

This approach is probabilistically equivalent to the one in 0, Il2| , but it is more realistic concerning the dependence 
on v of the Kramers-Moyal moments. In other words, this model is, in principle, a better dynamical approach. 
In regard of the measured values of q, 9, a in Tab. I, we verify that they are enclosed within a small interval in 
the q values, 1.19 ± 0.02 (close to |) and presents wider intervals for the other parameters, a = 2.63 ± 0.48 and 
9 = 8.31 ± 1.86. In Fig.|3we present the best (Pfizer, PFE) and the worst (Du Pont, DD) fits. 

With the a, 9 and q fitting values in Tab. I we have generated a set of time series aiming to test the validity of our 
approach. For the evaluation of the time scales 7 _1 and T, we have considered the simplest approach, i.e., the ratio 
between the two time scales which describe the CF for traded volume. See equity values of 7 T in Tab. I. As can be 
seen from Fig. [3 there is a fast decay of the CF, related to local equilibrium, and then a much slower decay for larger 
times that are due to a slow decay of correlations in to, i.e., 



C[v (t),v(t + r)] =C 1 e-' T + C 2 e 



-t/T 



(5) 



This slow decay is consistent with a slow dynamics of uj, necessary condition for the appliance of a superstatistical 
model. In our numerical calculations we have defined time in 7 _1 units and so 7 _1 = 1. The uj values used to mimic 
the time series were obtained from stationary Feller processes with a Ti relaxation for each i equity (see specific 
values of 7 T in Tab. I). Looking to Fig. [3] we have observed that our dynamical propose, using this simple approach, 
is able to provide good probabilistic description of the data. 



IV. FINAL REMARKS 



In this article we have analysed some statistical properties of the traded volume equities that constitute the DJ30 
index, namely the dependence degree between time series elements and stationary PDF. For the dependence degree 
we have used a non-extensive generalised Kullback-Leibler information measure. With this procedure we have studied 
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FIG. 3: Left: (Upper panel) Excerpt from the analysed Pfizer time series; (Lower panel) Excerpt from the time series generated 
to mimic Pfizer using the values presented in Tab. I. it in minutes) Right: Symbols represent the empirical PDF for Pfizer 
(shifted by a factor of 10) and Du Pont normalised traded volume time series, which correspond to the best (R 2 — 0.9953 and 
% 2 = 0.0002) (R 2 = 0.9763 and \ 2 ~ 0.001) and worst fits, respectively. The lines correspond to simulation using the values 
presented in Tab. I. 



the dependence between variables which decreases on a logarithmic way with the lag. We have also verified that this 
decrease of the dependence is much slower than the one observed in the correlation function. This fact indicates that 
non-linearities are present in traded volume dynamics and that they may be important factors in other statistical 
features such as multi-fractality Analysing the stationary distribution we have verified that it fits well for a 

q-generalised inverted Gamma distribution presenting a q value around | for all series. In addition, we developed a 
dynamical mechanism which has as stationary PDF the g-generalised inverted Gamma distribution. Further devel- 
opments of these model may be achieved using perturbative calculus for a more accurate determination of 7 [To| and 
determination of the ratio between the scale of local relaxation and the mean traded volume update |17| . 
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